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The principle of common cause is discussed as a possible fundamental prin- 
ciple of physics. Some revisions of Reichenbach's formulation of the principle 
are given, which lead to a version given by Bell. Various similar forms are com- 
l/" - ) . pared and some equivalence results proved. The further problems of causality 

O | in a quantal system, and indeterministic causal structure, are addressed, with 

a view to defining a causality principle applicable to quantum gravity. 
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1 Introduction 



X. 

In the search for new and more general theories of nature, it is of interest to ask 
which physical principles will survive in the next fundamental theory, and which will 
be only approximately true. Candidate answers have been of use in the formulation 
of theories in the past, and one might hope that they may be again, for instance in 
quantum gravity. Most directly, given a kinematical framework, physical principles 
can be used to constrain the dynamics until only a small class of theories remain (an 
example being the derivation of general relativity from the principle of equivalence, 
general covariance, etc.) In the causal set quantum gravity program, which is based 
on a simple kinematical structure, this approach is particularly natural, and has 
already been used to formulate a stochastic dynamics for causal sets (Rideout & 
Sorkin, 2000). 
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As superluminal signalling (or more objectively, superluminal influence) is widely 
held to be impossible in current theories, and moreover seemingly impossible to square 
with relativity, a condition based on this would be a strong candidate for a funda- 
mental principle (Dowker, Henson, & Sorkin, ). What form would such a principle 
take? It would be desirable to avoid two things: subjective statements involving ob- 
servers, and controversies in the philosophy of causation. Therefore, so far, the causal 
principles of most interest to physics have been those that give conditions, in terms 
of probabilities, that are meant to be physically reasonable without the need for one 
agreed definition of causes, effects and so on. These conditions, and their names, are 
many and various; screening off, the Reichenbach principle of common cause (PCC) 
(Reichenbach, 1956), local causality (Bell, 1987, pp. 52-66), and stochastic Einstein 
locality (SEL) (Hellman, 1982) are most widely used. Uffink's (1999) provides a good 
introduction to the PCC, raises some of the questions that I attempt to deal with 
below, and criticises other forms of the principle. 

The seeming variety of formulations might be taken as speaking against causal- 
ity as a fundamental principle. However, as the conditions are clarified, generalised 
and otherwise revised, they tend either to fall victim to paradox, or to converge to 
equivalence. An example of this is given in the first section of this article, where 
Reichenbach's PCC becomes a statement resembling Bell's version of screening off 
after a few well motivated revisions (Butterfield's (1994) includes a related discussion 
of SEL). In the same section, screening off is seen to be immune to certain paradoxes 
that afflict other PCC-like principles. Various other forms of the principle are com- 
pared and some claims of equivalence proved. An argument from (Dowker et al., ) is 
also touched on below: that even "weak relativistic causality" , a very weak "common 
cause" condition, is equivalent to screening off, if taken to be true when probabilities 
are conditioned on past events. 

All this would not be surprising - if quantum mechanics did not violate screening 
off. As it is, all of the stochastic definitions of causality fall down here. But still, 
quantum mechanics does not allow superluminal signalling, leading many to think 
that there really is no superluminal influence, i. e. that some principle of relativistic 
causality should still hold here. If this attitude is taken, then there is some assump- 
tion that has been used so far which needs to be dropped. One such assumption is 
this: that the framework of stochastic processes is a sufficiently general one to de- 
scribe our physical theories, containing all relevant information about the system in 
question, in particular everything that could possibly be relevant to causality. Recent 
developments in quantum mechanics cast doubt on this. Quantum mechanics can be 
described as quantum measure theory (Sorkin, 1994, 1995; Martin, O'Connor, and 
Sorkin, 2004), a generalisation of probability measure theory, giving rise to the idea 
of a quantal process as a more fundamental framework than the stochastic process. In 
(Dowker et al., ) a candidate principle called quantum screening off is derived using 
this line of thought, and shown to be obeyed by local relativistic QFT. In Section 3 
below, quantum screening off is reviewed and two forms compared, in close analogy 
to the previous section's discussion of stochastic screening off. 
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If some kind of PCC can indeed survive quantum mechanics, there is another 
hurdle to jump before it could be applied to any quantum gravity theory: indeter- 
ministic causal structure. Even in the stochastic framework, the problems involved 
in defining a causal principle for "background independent" theories are severe. Rec- 
onciling causality with general covariance, and finally moving to the quantal process 
framework, present further difficulties. These are expounded at the end of section 3. 



2 From Reichenbach to Bell 

2.1 Reichenbach 's principle and two generalisations 

The first attempt to formulate the PCC is given in terms of the probabilities of certain 
events. Not surprisingly, then, we are helped considerably by a good definition of 
events. If we call the space of all possible histories of the system in question f2, 
then an event is a set in an appropriate cr-algebra E of subsets of Q. All this means 
is that if {Ai} is a countable set of events, then [J i A i and A°, the complement 
of Ai, should also be events. The thought behind this is that, if we have defined 
the events "the temperature was below x in Dublin", then we must also be able to 
define events like "The temperature was above x and below y in Dublin", and so 
on. This piece of formalism helps to solve one of the major problems with the PCC, 
"Bernstein's paradox", later on. We also have the probability measure fi : £ — > R 
obeying the usual axioms. In this language, what might usually be called P(AhB) 
becomes /j,(A n B), P{-*A) is ^(A c ), etc. As usual, conditional probabilities are 
defined by fx(A\B) = fj,(Ar\B)/fx(B). A partition of the space f2 is an exhaustive list 
of alternative events, i.e. a disjoint set of events $ such that |J Ag4> A = fi. 

A sensible definition of dependence of events for the purpose of defining the PCC 

is 

^(B)^(AnB). (1) 

Eqn.(l) says that fi(B\A) ^ ^(B), i.e. that if we are given A, then that changes the 
probability we attribute to B (and vice-versa) . 

Consider the following situation. Two illusionists, one in Athens and the other in 
Brussels, each toss a coin at the same time. The event A of the coin in Athens coming 
up heads is correlated to the event B of the coin in Brussels coming up heads. Are 
we astounded by this feat? Not if the illusionists had previously met and randomly 
selected one of 2 pairs of biased coins to be used. The correlation is nothing out of the 
ordinary if, after conditioning our probabilities on which pair of coins was selected, 
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the correlation disappears. For instance, 



fx(A) — 0.5, //(B) = 0.5, /i(AnB) = 0, 
H(A\C) = 1, n(B\C) = 0, n(AnB\C) = 0, 
fi{A\C c ) = 0, fi(B\C c ) = 1, /i(A n B|C C ) = 0, 
^(C) = 0.5, 



(2) 
(3) 
(4) 
(5) 



where C is the event of selecting the first pair of coins. 

Considering examples similar to this, Reichenbach proposed the following principle 
for positively correlated events. 

The original PCC: if A and B cannot be causally connected, 1 and fi(A U B) > 
n{A)n(B), then there exists an event C such that 



As noted by Uffink (1999), conditions (8,9), and the requirement of positive cor- 
relation are really just window dressing, expressing the idea that C is a "cause", and 
that a cause is usually taken to mean something that makes an "effect" more likely 
to happen. But if the aim of the PCC is to demand an explanation of certain cor- 
relations, the negative correlations need explaining too, and the explanation by (6,7) 
works even if (8,9) are not true, for instance in the above example. Therefore C does 
not have to be a "cause" in the sense that it makes A and B more likely, and this 
word will be avoided in favour of more neutral terms. 

First revision of the PCC: if A and B cannot be causally connected and (1) is 
true, then there exists an event C such that (6) and (7) are satisfied. 

There is another fairly obvious generalisation. What if our illusionists were to 
choose between more than two pairs of coins? This situation could well violate the 
first revision of the PCC, as now (7) may fail to hold. But this is no more amazing 
than their first plan; there is nothing physically unreasonable about the correlation. 
We need another principle. 

1 In Rcichcnbach's definition, he referred to A and B as being simultaneous. This class of events 
is very limited in non-relativistic physics and undefined in relativistic theories, and therefore not 
very useful. The phrase "cannot be causally connected" is to be clarified below. 



(j,(A n B\C) = fi(A\C)fi(B\C) 



(6) 
(7) 
(8) 
(9) 
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Second revision of the PCC: if A and B cannot be causally connected and (1) 
is true, then there is a partition $ of Q such that 

li(AnB\C) = n(A\C) f ji(B\C) VCG$. (10) 

This definition is weaker than the previous one since {C, C c } is a partition of Q. It 
has previously been proposed for different reasons (Uflink, 1999). 

Motivated by this, a screening event is defined as an event C such that (1) and 
(6) hold for some A and B; the definition is only a casual one and might be extended 
to events such that (1) and (6) are true when the probabilities are conditioned on 
some past event. 

2.2 Simpson's paradox and the necessity of a spatiotemporal 
causal structure 

What correlations are in need of explanation? The vague formulation "could not 
be causally connected" needs clarification. One answer is that events happen in 
spacetime, and spacetime has a causal structure. (Care must be taken here with 
nomenclature: causal relations and causal structure will be used in the physicists' 
manner as describing the light-cone structure of spacetime, and are not directly re- 
lated with any favourite philosophical notion of causation). Events that occur in 
spacelike (i.e. causally unrelated) regions should be the ones required to obey the 
PCC. This gives a rigorous concept to replace "could not be causally connected". For 
everyday use we might make an effective definition of causal structure: for instance, 
the coin tosses in the previous example could be considered as occurring in effectively 
causally unrelated regions. There are all kinds of possible generalisations. All that 
is required is a "spatiotemporal" partial order S in which events occur, e.g. a weakly 
causal Lorentzian manifold 2 or a causal set. More arguments will be presented in 
favour of this later on. 

Using this definition requires some caution: it amounts to an acceptance that 
"causal influences" cannot propagate outside of the light-cone. If the PCC were to 
fail using this definition of causal structure, it leaves the possibility that it is not 
Reichenbach's PCC, but the relativistic principle, that has failed — a view embraced 
in pilot wave theory ("Bohmian mechanics"). 

Subsets of S, or "regions", will be denoted by calligraphic typeface. As in relativity 
the past of a point x G § (i.e. the set of all points in § that are less than x in the 
causal partial order, including x itself) is called J~(x), and J~(X) = [J xe x J~( x )- 
The future set J + (X) is defined similarly. The notation X\y will be used to indicate 
that the regions X and y are spacelike, i.e. that J~(X) ny = and J~(y) C\X — 0. 

2 A weakly causal Lorentzian manifold is one that contains no closed causal curves. 
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2.2.1 Domains of decidability 

Now the concept of the region in which an event occurs has become important. For 
every event A, there is assumed to be a unique smallest region dom(A), such that 
knowing all the properties of the history in dom(A) enables us to decide (without 
further knowledge of /x) whether A occurred or not, called the least domain of decid- 
ability. This is not the only sensible prescription that could be made. For instance, if 
the history space Q was made up of solutions of the Maxwell equations, there could 
be several such regions which did not overlap. If A was a set of histories in a which 
beam of light passes through a region A (a well enough defined event in the classical 
theory), then given the field in A we could decide whether A occurred or not. But we 
could also do so given the field in a thickened spacelike slice of J~(A)\A. However, 
the requirement that there be a single domain of decidability for each event is not 
restrictive. The above situation could be easily represented by making the history 
space f2 the space of all possible configurations of the electromagnetic field, and sim- 
ply setting the probability of the set of non-solutions to be 0. In this larger history 
space, knowing all the properties of a history within one region A decides no event 
that occurs outside A, without knowledge of /i. 

Again, some definitions will help. It will not be necessary in this paper to con- 
struct any framework dealing with properties of histories, their relation to events, and 
their locations in the spacetime structure; it is enough to assign a least domain of 
decidability to each event by fiat, with some rules for consistency. The least domain 
of decidability is defined as a function dom : E — > pow(§) (where pow(§) is the set 
of all subsets of S) with the following properties: 

For all countable subsets A of E, 

(i) dom(X) n dom(F ) = VX, Y G A such that X ^ Y 
=>- dom(n xeA A) = U XeA dom(X). 

(ii) dom(X) = dom(F) VX, Y G A =>• dom(n xeA X ) c dom(F) V Y G A. 

(iii) dom(X c ) = dom(X) VX G E. 

(iv) \/Z G E s.t. dom(Z) = X U y, Z is a member of the <7-algebra generated by 
T(X) U T(y), where T(X) = {X G E : dom(X) C X}. 
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Here U indicates disjoint union 3 . Properties (i-iii) are fairly intuitive, while (iv) 
ensures some "locality" in the decidability of events: it says that any statement about 
events in the region X U y is a logical combination of statements about events in X 
and y (this assumption is considered in more detail later). It will be said that "X 
occurs in X n if dom(X) C X. One immediate consequence of (i) and (iii) is: 

For all countable subsets A of E, 

dom(X) n dom(F) = VX, Y G A such that X 

dom(lJ XgA X) = |J XgA dom(X). Similarly, (ii) and (iii) give: 

For all countable subsets A of E, 

dom(X) = dom(F) VX, Y G A =>> dom(U XeA X) C dom(F) V Y G A. 

It is also useful to note that dom(0) = dom(fi) = for any suitably nontrivial E 
and dom. This can be seen by noting that, by (ii) and (iii), dom(0) = dom(X c nX) C 
dom(X) for all events X, so as long as there are two events with disjoint domains of 
decidability, the only possibility is dom(0) = 0. This weak assumption will be made 
throughout the following. 

If some knowledge is gained about events in some region, what do we know about 
events in a subset of that region? If X C y — dom(F), then the restriction X = 
Tx(Y) of Y to X is defined as the intersection of all events Z such that Y C Z and 
dom(Z) C X. This restriction represents the most specific event decidable in X given 
byY.' 

A full specification of a region 1Z is defined to be a non-empty event F such that 
dom(F) C 1Z and 

dom(X) C K =>■ F C X or F C X c VX G E, (11) 

i.e. "given F, all events that are decidable within 1Z are also decided". Let $(7£) be 
the set of all full specifications of 1Z. The following lemmas are useful. 

This lemma states that if everything that happens in two regions is known, then 
everything that happens in the union of these regions is known. 

3 These provisional definitions may need modification. It is not immediately clear that the count- 
able unions in (i,ii) are strong enough. For instance, if dom("The scalar field holds value x at point 
X") is the point X, then dom("The scalar field held value x throughout region 1Z") should be 1Z. 
The question of whether this follows from (i-iv) is left unanswered for now as the definitions given 
are sufficient for the purposes of this article. 
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Lemma 1 If A and B are disjoint regions and A,B are full specifications of them 
respectively, then An B is a full specification of AuB. 

Proof: Take an event Z such that dom(Z) C A\J B. From property (iv) of dom, Z 
is in the cr-algebra generated by T(A) U T(B), being defined as in the definition 

of property (iv). From the definition of a full specification, we have that either A E X 
oricT for all X E T(A), and either B C Y or B C Y c for all Y E T(B). It 
follows that A n B E W or A n B C W c for all W E T(A) U T(B), and so, by the 
properties of a a-algebra, ACiBeWotACiBe W c for all W that are members of 
the a-algebra generated by T(A) UT(B). This implies that AC\B E Z or AC\B E Z c . 
So dom(Z) C*4.U£>=^Ari-E>C Z WZ E S, and the definition of a full specification 
is satisfied. □ 

The next lemma states that if everything in a region is known, then everything 
in a subset of that region is known. 

Lemma 2 Given a full specification A of A and a set B E A, B = T^(A) is a full 
specification of B. 

Proof: Take X such that dom(X) C B. From property (iii) of dom we also know 
that dom(X c ) C B. Since B E A, and A is a full specification of A this implies that 
either A E X or A E X c . But from the definition of restriction, B is a subset of all 
sets Z such that A E Z and dom(Z) C B, so either B E X or B E X c . This proves 
that dom(X) C B =>• B E X or B E X c VX E S. □ 

Corollary 1 = for some finite set of regions A-i, then a full specification 

FoflZ can be written F = P| i Ai where Ai is a full specification of At. 

Lemma 3 &(7Z), for any region 1Z, is a partition ofVL. 

Proof: Every singleton set / in Q is a full specification of § because it has no subsets 
4 . By lemma 2 every such / is in a full specification of 71, defined by r n (f). No 
singleton can be in more than one full specification of 1Z, so &(1Z) is a disjoint set. □ 

2.2.2 Avoiding Simpson's paradox: Bell's PCC 

The PCC as it stands looks weak enough; we have prevented the illusionists from 
fooling us into calling a physically reasonable correlation "magic". But what if two 
wizards were aiming to do the opposite: fool us into believing that their magic was 

4 It has been assumed here that f2 can be discribed as a union of singletons. 
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reasonable? They could try a similar trick to the illusionists. They are possessed 
of two magical pairs of coins which produce correlated outcomes with no common 
cause. Before setting off to perform their magic, they randomly select one of the two 
pairs. For instance (taken from Uffink (1999)): 



/i(A) — 0.5, //(£)= 0.5, n(A n B) = 0.25, (12) 

H(A\S) = 0.5, n(B\S) = 0.5, //(^4 n B\S) = 0, (13) 

fi{A\S c ) = 0.5, /i(fi|5 c ) = 0.5, n(A n 5|S C ) = 0.5, (14) 

MS) = 0.5, (15) 



where A and _B are as in the previous example, as S is the event of choosing the first 
pair of magic coins. This situation is not one that we would want our condition to 
allow. Yet, without taking S into account, it looks like there is no correlation, and 
the common cause principle is not violated. This is known as Simpson's paradox. An 
event like S is called a Simpson event: an event S such that 

n(AriB)=n(A)n(B), n(AriB\S)±n(A\S)n(B\S). (16) 

It is in a sense the opposite of a screening event. 

The common cause principle as it stands looks too weak in the light of this. 
It needs to be strengthened, so that it is still valid even if the probabilities are 
conditioned on events to the past of A and B. But what is meant by "the past of A 
and B" without the explicit introduction of a causal structure? If there is no such 
structure, then Simpson's paradox is hard to avoid, as Uffink (1999) points out. With 
the causal structure, a common cause principle can be formulated which seems well 
able to avoid the paradox. The result is: 

Screening Off (SOI): For all events A and B with dom(A) C A and dom(_B) C B, 
if A \ B, then 

n(Ar\B\C) = n{A\C)n{B\C) VC e $('P 1 ), (17) 
where V\ is the mutual past V\ = J~(A) fl J~(B). See fig.(l). 

If the screening factors C are full specifications of the past, then there is no room 
for a Simpson event in the past to reintroduce correlations. Neither does the new 
definition seem too strong: we would expect, after conditioning on all events to the 
past, that two events in spacelike regions would be independent. Thus through 3 
strongly motivated modifications of Reichenbach's original principle we have reached 
some agreement with Bell's definition of causality (Bell, 1987, pp. 52-66) (in which 
he named a similar "screening off" principle "local causality"). 

Some justification should be given for requiring (17) for any regions A and B such 
that dom(A) C A and dom(5) C B, rather than only those such that dom(A) = A 
and dom(_B) = B. This distinction is not as great as it might appear. Consider a 
countable set of events {Ai} such that A = | |^ A h and let dom(Aj) = A for all Aj. 
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Figure 1: A "spacetime diagram" of the regions involved in screening off (making S look 
like Minkowski space) . Two spacelike regions A and B are shown along with their past light 
cones. V\i shown in grey, is the mutual past of A and B. 

From properties (ii) and (iii) of dom, the region dom(A) is contained in A (see the 
immediate consequnces of the properties of dom). Requiring SOI, with this A, holds 
for each Aj, with B implies that it must also be true for A: 

t i{A i nB\C)=n{A i \0)^B\C) VC7g$(P 1 ), (18) 
=► //((□A)n J B|C) =//«□ A)|CV(5|C) VCe*(Pi), (19) 

i i 

so the definition requiring that dom(A) = A, when applied to all the events At, implies 
the definition requiring dom(A) C A in this case. Such a case can be constructed 
for any A D dom(A) as long as there are events with A as their least domain of 
decidability. 

2.2.3 Which past? 

No justification of the use of the mutual past has been given above. The following 
principle seems just as well motivated: 

A Second Screening Off (S02): For all events A and B with dom(A) C A and 
dom(5) C B, if A \\ B, then 

fi(Af]B\C) = fi(A\C)fi(B\C) VCe$(V 2 ), (20) 

where V 2 is the joint past V 2 = J~(A) U J~(B)\(A U B). 

Which of these is the most physically reasonable? If SOI is accepted, might 
Simpson events occur in V 2 but outside of V\l In fact, this cannot happen. It is ruled 
out when SOI is applied to pairs of events other than the pair {A, B} in question. The 
two definitions turn out to be equivalent, at least under a simplifying assumption. 
The assumption is that there are only a finite number of full specifications of any 
region. 

To see this we will need the following lemma in probability: 



10 



Lemma 4 If 



vx g $(x),y e $(y), 

H{A n Y)n(B n X) = n(A n 5 n X n Y) , and 
fi(AnY)fi(X) = fi(AHX HY) , and 

fi(Y)fi(B ni) = fj,(B niny) , and 
/iOHMX) = /i(x n Y) 



(21) 
(22) 
(23) 
(24) 



then 



fx(A\X r\Y)fi(B\X HY) = fi(AC]B\X f]Y) vx G $(X), Y e $(y). (25) 
Proof: (21) and (24) imply 

/jl(A n y)h(b n x)n(x)n(Y) =/i(Anfinin y V(x n y) vie &(x), Y g 



/x(Anxny)/i(finxny) = //(An^nxny^xny) vx e $(x), y e $(y), (27) 



which is equivalent to (25). □ 
Claim 1 SOI implies S02. 

Proof: Assume SOI. Take any pair of events A and B, such that dom(A) C A 
and dom(5) C B and A \ B. As above, Vi = J' (A) n J~{B) and V 2 = J' (A) U 
J-(B)\(AU B), and it is also useful to define X = (J~ (A)\A) \ J~ (B) and y = 
(J~(B)\B)\J~(A). SOI will be applied to other pairs of events in order to show 
that S02 also holds. 

Consider the pairs of events {(A n X), (B n Y)}, {(A n X), Y}, {A, (B n Y)} and 
{X, y}, where X is a full specification of the region Af, and y is a full specification 
of y. To apply SOI to these pairs we need to establish that the members of each pair 
occur in spacelike regions, and find the mutual past of those regions. Recall that a 
full specification X e &(X) is defined so that dom(X) C X, and since A is disjoint 
to X, dom(AflX) C AUX, for all X G <&(X), from property (i) of dom (similarly for 
B and Y). It is also the case that dom(X) C AU X and dom(y) C Buy. From this 
we can see that for all four pairs of events, the two events occur in the pair of regions 
{(A U X), (B U y)} respectively. It can be seen that (A U X) \\ (B U y) , and that the 
mutual past for this pair of regions is V\. This is true for any causal structure (from 
its definition as a partial order), and is especially clear for Minkowski space, with 
reference to fig. 1. Thus SOI can be applied four times, each time conditioning on 



(26) 



Substituting from (22,23) into the LHS, we have 
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a full specification C of the past region V\. Using the notation //(•) = //(-|C), SOI 
gives: 



vx g $(#),y g $(y),ce $(Pi) 

n Y)Jl(B n X) = jl(A f] B f] X f]Y) ,and (28) 

jl(A n F)/j(X) = //(A n X n F) , and (29) 

p,(Y)jl(B n X) = jl(B nlny) , and (30) 

OTW=^ny). (3i) 



From lemma 4, this implies that 

jl(A\XC]Y)jl(B\Xr]Y) = j2(Af]B\Xf]Y) VX G F G &(y), C G $(Pi). (32) 

From corollary 1, if C G X G and F G $(F), then C n X n F G $(7 3 2 )- 

Eqn. (32) is therefore equivalent to (20) and so S02 is satisfied for the pair of events 
{A,B}. □ 

Note that the assumption (that there is only a finite number of full specifications 
of any region) was needed here in order to use probabilities like fJ>(X) (which would 
always be if the history space was continuous). That is the only reason why it is 
needed in the proof of equivalence of SOI and S02. As such the assumption could 
probably be removed, at the cost of making a more subtle use of measure theory in 
the proof of claim 1. 

Claim 2 SO 2 implies SOI. 

Proof: This claim is the easier to prove. The argument is similar to one given 
by Shimony, Home, and Clauser (1993, p. 165). As above, take any pair of events A 
and B, such that dom(A) C A and dom(B) C B and A \ B. We only need note that, 
if dom(A) C A and dom(B) C B, then it is also true that dom^) C A U X and 
dom(.B) G Buy, where X and y are defined as before. The joint past of the pair of 
regions {A U X, B U F} is V±. Since A U X \\ B U y, S02 gives 

fi{AnB\C) = fi(A\C)fi(B\C) VCe$(Vi). (33) 

From this, we know that, for any events A and B such that dom(A) C A and 
dom(_B) C £>, if A \ £>, then the above equation holds. This is exactly SOI. □ 

Corollary 2 SOI is equivalent to S02. 

Generalisations of this result may be possible. For example the following form of 
screening off is conjectured to be equivalent to SOI: 
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Generalised screening off: For all events A and B with dom(A) C A and 
dom(B) C B, if A \\ B, then 

/i(AnB\C)= /i(A\C)ii(B\C) VCe$(V'), (34) 

where V contains the mutual past V\ = J~(A) H J~(B) and does not intersect the 
regions J + (-4) or J + (£>). V could, for example, be J~(A)\A, as in Bell's original 
formulation. 



2.3 Avoiding Bernstein's paradox 

How does SOI handle correlations between multiple events? For some versions of the 
PCC, this is a fatal problem, which Uffink calls Bernstein's paradox (Uffink, 1999). 
It stems from the observation that, even if n events are each pairwise uncorrelated, 
it is not implied that there is no "mutual correlation" , i. e. 

fi(Ai n Aj) = n(A^n(Aj) e {l,2,...,n} (35) 

n n 

fi(f]A l ) = Hf,(A l ), (36) 
i=\ i=i 

where {Ai} is a set of n events. What are we to do about such mutual correlations? 
Do we need a further principle? No - SOI already covers this case. Armed with 
the definition of events, we can see that, if all the events in {Ai} occur in spacelike 
regions (i.e. if dom(Aj) \ dom(A,-) for i ^ j), then {Ai, Aj}, i ^ j are not the only 
spacelike pairs of events that are required to satisfy SOI. From property (i) of dom, 
it can be seen that for any Ac {1,2,..., n}, 

dom(f|A) = m ( 3T ) 

ieA «eA 

(where Ai = dom(Aj)), so quite a few pairs events of the form f] i&A Ai occur in 
spacelike regions. Assuming that all spacelike pairs are independent, there is no 
mutual correlation: 

n n 

Mn^) = M^i)MD^) ( 38 ) 

i=l i=2 

n 

= f ,(A 1 )^A 2 )^(f)A l ) = ... (39) 

i=3 

n 

= IIMA) (40) 
i=l 

Without presenting the argument in detail, it is claimed that considerations similar 
to those in the previous section show that the following is a consequence of SOI: 
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Screening off for multiple events: Let {A,;} be a set of n events, and let 
dom(Aj) C Ai Wi, for some set of regions {Ai}. If (36) is true and Ai \ Aj e 
{l,...,n}, % ^j, then 

n n 

Kf)A t \C) = H^A t \C) WCe^Vj), (41) 
i=i i=i 

where Pj is the joint past Vj = (J" =1 </ - (-4i)\(Ur=i anc ^ ^ = dom(Aj). This 
definition is similar to S02; some other past regions can be substituted for Vj. It is 
similar to a formulation by Uffink (1999). 

2.4 Further revisions? Causality and locality 

SOI is presented here as the most reasonable form of the common cause principle in 
stochastic theories with fixed causal structure. But there are a few ways in which it 
could be modified for certain purposes. 

At this point some terms need to be clarified. What do the terms "locality" 
and "causality" stand for, and are they really the same thing? Bell (1987) calls 
SOI "local causality". By "local" here he seems to be thinking of something like 
Einstein locality, a condition for deterministic theories which states that everything 
in a region A should be fully determined by the configuration in J~(A)\A. But 
other common uses of the word conflict with this in three ways. (1) If a field theory 
is local in the sense that its action is an integral of a local function over spacetime, it 
is not necessarily Einstein local (consider electromagnetism with c replaced by 2c, for 
example); (2) Einstein locality does not exclude the possibility that the occurrence 
of events in the "deep past" could directly influence those in the future ("temporal 
action at a distance"), since it does not state that the configuration in a region A 
should be determined by the configuration in a thickened spacelike slice of J~(A)\A; 
(3) Einstein locality does not require events to be defined locally. Einstein locality 
can be thought of as a deterministic causality, but similar problems arise in stochastic 
and quantum theories. In all these ways, then, causality is not equivalent to locality. 

The third of these differences has not been dealt with yet; property (iv) of the 
least domain of decidability effectively requires that events be defined locally. But 
this is not the only sensible definition that could be made. In some cases it is not 
even physically realistic. For example, in non-abelian gauge field theories, the gauge 
invariant "observables" cannot be defined locally. 

Removing property (iv) would allow events to be non-local in general in the sense 
that the events that occur in a union of regions may not be deducible from those in 
the individual regions. SOI looks just as reasonable with this weaker definition of 
dom. Some of the lemmas stated above would require modification if claims 1 and 2 
were to be proved in this case, however. In particular, lemma 1 would no longer be 
true in general. 



14 



Another revision might also be considered. There is a further possible definition of 
the past that has not hitherto been mentioned: Penrose and Percival (1962) suggest 
that it should be any region Vd that dissects J~(A) U J~(B) into two parts, one 
containing A and the other containing B. This definition is stronger than SOI. Is 
this strengthening justified? Problems (2) and (3) above are relevant here. If an event 
in the deep past can affect events in A, but no event in between, the Penrose-Percival 
principle may fail to take into account common causes for A and B lying in V\\Vd- 
Non-local events of the type already considered in this section could be in the past 
V\ but not in every region Vd- Thus this condition is stronger than SOI in two ways. 
If suitable locality conditions were assumed, such that both of these possibilities were 
ruled out, then it is a reasonable conjecture that Penrose and Percival's definition 
would also be implied by SOI and S02. 

Finally, some modifications to the PCC might be desirable in the context of 
cosmology (Earman, 1995, chap. 5). Initial correlations are disallowed in S02, i.e. 
screening off is required even if A or B contain a part of the initial hypersurface. 
There is no obvious reason to allow or disallow this, so the option remains open. 
Consider the following definition. 

Weakened screening off (S02w): For all events A and B with dom(A) C A and 
dom(5) C B such that A and B are of finite extent and do not contain any part of 
the initial hypersurface, if A \ B, then 

fi(AnB\C) = fi(A\C)fi(B\C) VCe$(7> 2 ), (42) 

where V 2 is the joint past V 2 = J~(A) U J~(B)\(A U B). 

Here the "initial hypersurface" should be taken to mean the set of points in the 
causal structure with no points to their past, and "finite extent" is intended to mean, 
in the case of a Lorentzian manifold, that there are no past-infinite causal curves in 
the region (no past-infinite chains in the case of a causal set, etc.). This definition 
stems from a suggestion of Rafael Sorkin's. The proof that S02 =>- SOI cannot be 
modified to show that S02w =>- SOI; at least, not without assuming that there are 
no correlations between events occurring on spacelike sections of the initial hypersur- 
face (and that the causal structure is past-finite). (It should be noted that, although 
this definition of causality may be weaker than that which is used to prove the Bell 
theorems, any attempt to evade them on this basis would require all EPRB exper- 
iments to be explained by initial spacelike correlations that are propagated toward 
the laboratory from far distant regions - a conspiratorial scenario that is unbelievable 
for most scientists). The question of whether the universe contained spacelike corre- 
lations in its earliest history is one for observation. However, a definition of causality 
which treats initial points as special may not be natural for quantum gravity. 

The big problems of quantum mechanics and dynamical causal structure also 
suggest revisions. They are given their own section below. 
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2.5 Other ways to justify SOI 

In (Dowker et al., ), an easier way to reach the same conclusion about the PCC is 
given. 

Weak relativistic causality: if A \ B, with A D dom(A) and B D dom(B), 
and A is correlated to B, then there exists an event C such that dom(C) C V\ = 
J~(A) fl J~(B), and A is correlated to C and B is correlated to C. 

This seems to be the weakest statement that could be made along the lines of a 
PCC. However, in view of Simpson's paradox, this principle needs to be true even 
when the probabilities are conditioned on other events in V\. With this strengthening, 
weak relativistic causality can be shown to be equivalent to SOI. 

It is also true that, if SOI holds for some model, then it is possible to introduce an 
Einstein local deterministic hidden variable theory that reproduces the probabilities 
of that model (Fine, 1982) 5 . The converse is also true. (Incidentally, if the hidden 
variable theory is time reversal invariant, then there is also a corresponding principle 
of common effect, which makes it impossible to define the arrow of time by the PCC, 
if such an underlying theory is assumed.) This is another way to get back SOI from 
a set of "reasonable physical assumptions"; as discussed in the next section, it also 
highlights how reasonable and physical those assumptions really are. 

3 Quantum mechanics and dynamical causal struc- 
ture 

It is well known that the above principle, SOI, is violated in standard relativistic 
quantum theory (Bell, 1987, pp. 52-66) (although arguably the corresponding ex- 
periment has not been successfully performed (Percival, 2001)). Correlations arise 
for which SOI fails, even if we concoct "hidden" events in the past. So presumably, 
if the illusionists were perseverant enough, they could manage to produce a strange 
looking correlation, if we consider anything that breaks SOI to be strange. All the 
talk about what is "physically reasonable" seems to have been in vain; SOI is not so 
physical after all. 

There are different attitudes that can be taken to this situation, adapted here from 
the concise list of Butterfield's (1994): you can reject the whole notion of the PCC as a 
fundamental principle; you can see quantum mechanics as indicating that the PCC as 
set out above is inadequate and needs further revision; you can stick to a screening off 

5 The "factoring" condition used by Fine (1982) is actually not quite screening off, but a condi- 
tion that follows from it when some assumptions of independence are made about the settings of 
experimental equipment (a "freedom of choice" assumption). 



16 



principle, but reject relativistic causal structure and say that there is superluminal 
influence (as in Bohmian mechanics); or you might hope that future experimental 
results will be different from the predictions of quantum mechanics (Percival, 2001). 
The following discussion takes the second tack. Firstly, this is because superluminal 
signalling is still ruled out in quantum theory, and the PCC could reasonably be 
equated with relativistic causality, when the latter is defined appropriately. Secondly, 
although screening off fails, there is still a vague sense in which the correlations arose 
as a result of some "cause" in the mutual past of the correlated events (for instance, in 
the classic example, an entanglement is set up between two photons which might be 
seen as the "cause" of the correlation), and so it is reasonable to attempt to formalise 
this notion. But, had quantum mechanics not come along, it is doubtful that anyone 
would have foreseen the need for a generalisation of SOI. So what has gone wrong? 

3.1 Quantum screening off 

First, a decision must be made about how to view causality in quantum mechanics. 
It is usual to claim that superluminal signalling is banned in standard QFT, and this 
is usually held to follow from the commutativity of spacelike operators. This does 
guarantee that some events, such as the performance of a von Neumann measurement, 
will not be noticeable to a conspirator in a spacelike laboratory, in the absence of 
other measurements by third parties. But this is not the end of the story. This 
kind of discussion of superluminal signalling is based on external agents carrying out 
measurements and other operations on a quantum field, in attempts to send spacelike 
signals. Even apart from the fact that naively applying this framework to QFT can 
be shown to allow superluminal signalling (Sorkin, 1993), it seems inappropriate for 
cosmological theories. More preferable to this would be a definition of causality that is 
given in terms of the dynamics of the quantum system, without reference to classical 
observers of any kind. Then the search for a "relativistic causality" principle becomes 
more like the discussion of the PCC that is given above for the stochastic case. For 
more of this argument see (Dowker et al., ). 

If an observer independent, quantum PCC is being sought, then some idea needs 
to be given of how to understand quantum mechanics at all without observers. Most 
appropriate for the present purpose, since we have already begun to think about 
histories, are the various "histories" approaches. The details of the interpretations 
will not be needed here; the formulation of quantum mechanics associated with them 
is more relevant. The central idea is the replacement of the stochastic measure 
/i : £ — ► K used above with a non-additive quantal measure /i q : £ — ► K, obeying 
generalised axioms (for a full introduction to this idea see Sorkin, 1994) . Equivalent 
to this, and sometimes easier to work with, is the decoherence functional D : E x S — > 
C (Hartle, 1992) which has the following properties: 

Hermiticity: D(A, B) = D*(B, A) for all A.BeS 
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Positivity: D(A, A) > for all A e S 



Additivity: D(A U B,C) — D(A, C) + D(B, C) for all A, B e E such that A and 
.B are disjoint. This can be extended to countable sums if necessary. 

Normalisation: D(tt,Vl) = 1 . 

The value of the quantal measure is defined as fi q (A) = D(A, A). A quantal 
process is defined by the triple {jj, q , E, Q}, and all the definitions of events and least 
domains of decidability may be carried over 6 . 

Now there is a potential answer to the question of why SOI is too strong for 
quantum mechanical systems: quantum mechanics is not naturally described as a 
stochastic process. But all of the reasoning in section 2 was based on the assumption 
that a stochastic process defined by the triple {/i, £, Q} can indeed adequately capture 
the dynamics of the universe — in particular, that it can contain all information 
relevant to a principle of common cause. Therefore we need to drop this assumption 
and work with a quantal process instead of a stochastic process. 

Since the interpretation of the quantum measure is not so straightforward as that 
of the probability measure, what is "physically reasonable" and what is not becomes 
shadier here, making arguments for a quantum version of screening off less compelling. 
It is significant that for some events, the fact of whether they occurred or not can 
never be known. Should such events be allowed as the analogues of screening events 
here (can they be "causes")? Should the PCC still hold when they are conditioned 
on? Taking a realistic approach, the most natural (and fruitful) answer seems to be 
yes 7 . 

With suitable definitions of independence of events, and conditioning, a similar 
argument to one mentioned above in section 2.5 can be used to derive a quantum 
version of screening off. The following result is obtained in (Dowker et al., ). 

Quantum screening off (QSO): Let A, A and B, B be events and A D dom(A), 
A D dom(A), B D dom(B) and B D dom(B). If A \\ B, then 

D(AnBnc,AnBnC)D(c,C) = D(Anc, AnO)D(Bnc,BnC) vc,c e$(Vi), 

(43) 

6 This procedure, of denning events only on S and not on some ortholattice as in (Isham & Linden, 
1995), is equivalent to picking a preferred basis (e.g. position of particles) in which to define the 
histories. This is still very much a quantum process, as pairwise interference of alternatives, which 
may be taken to be the defining quality of a quantal process, is still present. 

7 In the decoherence/consistent histories interpretation, the only "knowable" events are those 
that decohere. If only these were considered when formulating a causality principle, we would not 
be any better off than we were in the stochastic case; the measure restricted to decohering events is 
a probability measure. 
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where V\ is the mutual past V\ = J (A) fl J (£>). 

This causality principle is formally true for local, relativistic QFT (Dowker et al., 
), providing good evidence that it is not too strong to be "physically reasonable", 
and reduces to SOI when all events decohere {i.e. when the decoherence functional 
is diagonal). 

3.1.1 Which past for QSO? 

Conditioning on the joint or mutual past (the analogue of the choice between SOI 
and S02) is also equivalent here. A cheap way to prove this is by a reformulation 
of the condition, making it possible to reuse the proof of the stochastic result. The 
decoherence functional ranges over pairs of sets in fl. Thinking of a measure space 
Cl = Q x Q, the decoherence functional can equivalently be represented as a function 
fi from a set of subsets E of Q to the complex numbers. Sets in E, which will 
be called "pseudo-events", are defined as all sets X x Y where X, Y e S, i.e. as 
pairs of events. From the definition of the decoherence functional, the function ft 
has some properties in common with a probability measure: it is additive, so that 
jl(A U B) = ft(A) + £l(B) for disjoint pseudo-events A and B, and normalised to 1. 
However, it is not a probability measure because it is not bounded above or below. 
The least domain of decidability of a pseudo-event is defined by dom p (A x A) — 
dom(A) U dom(A) (the definition for pseudo-events need not have properties (i-iv); 
it is shown in the appendix that this function does have the necessary properties 
to prove useful results). A pseudo-event is called a full specification of 1Z iff it is 
the product of two events that are full specifications of 7Z, and <& P (1Z) is defined 
accordingly as the set of all full specification pseudo-events (this is clearly a partition 
of Cl). In this framework, The QSO condition looks much like SOI: 

A restatement of Quantum screening off (QSOl): For all pseudo-events A 
and B with dom p (A) C A and dom p (5) C B, if A\\ £>, then 

p.{A n b n C)fi(C) = fi(A n c)p.(B n c) vc e %(Vi), (44) 

where V\ is the mutual past V\ = J~{A) fl J~{B). 
There is a joint past alternative to this: 

Another form of Quantum screening off (QS02): For all pseudo-events A 
and B with dom p (A) C A and dom p (5) C B, if A \ B, then 

fi{A n b n C)fi{C) = fi{A n c)ji(B n c) vc e %(Vi), (45) 

where V x is the joint past V x = J~{A) U J~(B)\(A U B). 
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The proof of QSOl QS02 is almost identical to that of SOI S02. The 
properties of the function /}, and of the domain of decidability of pseudo-events, are 
similar enough to the properties of \i and the domain of decidability of events to 
ensure this. However, since jx can be negative, it is now possible for £l(B) to be zero 
while jj,(A fl B) is non-zero, and so (i(A\B), the analogue of fx(A\B), is sometimes 
not well defined. Therefore, for completeness, the proof is repeated for the quantum 
case, without use of such conditional statements, in the appendix. 

3.2 Dynamical causal structure 

QSOl (SOI) is a formal statement of the notion that causal influences should prop- 
agate only within the light-cone, with the assumption that the quantal (stochastic) 
process is an adequate framework for our dynamical theory. But in general relativity, 
and presumably in a successful quantum gravity, the light-cone structure of space- 
time is itself dynamical. How can a PCC be formulated if the regions in which events 
occur have different causal relations in different histories - or worse still, cannot be 
identified as being the same region in different histories? If we also seek to impose the 
general covariance of GR on our theory, how does this affect these questions? At the 
present time this seems to be the greatest problem for the definition of a PCC that 
could be of use in quantum gravity. No solution is proposed below, but the problems 
are pointed out and some existing ideas are discussed. 

3.2.1 The stochastic case 

Before looking at any quantum indeterminism of causal structure, many of the sig- 
nificant problems with indeterministic causal structure can be brought to light by 
considering stochastic processes. Previously, the least domain of decidability of an 
event was defined by fiat on a causal structure S. A deeper description would come 
from considering each history (i.e. point in history space) as a list of values of proper- 
ties attached to regions in the causal structure, like the values of a field on Minkowski 
space M. We are now moving to a theory in which the histories contain the causal 
structure, which can be different in each history (for example, each history could be a 
different Lorentzian manifold). The idea of domain of decidability is lost, as regions 
now have no significance for the whole history space. Is it possible to restore enough 
of the idea to write down a condition similar to SOI or S02? 

In S02, probabilities are conditioned on a full specification of the past set 8 Vi- 
Can this be done when causal structure is stochastic? Firstly, any full specification C 
of V<i would still be an event, a set of histories in S. It would be necessary to identify 
regions in different histories as being "the same region": a region like V2 must be 

8 A past set in causal partial order is denned to be one that contains its own past: a region TZ 
such that K = J~(Tl). 
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be defined on a set of histories containing C. This is where the major conceptual 
problems lie. If we knew how to do this, it would be possible to similarly define past 
sets V2 U A and P 2 U5, and make sure that the regions A and B were spacelike to 
each other. With this done, a principle like S02 could be recovered. So the crucial 
question is: when are two past sets in different histories the same region? 

A special strategy can be used to answer this when the history space consists of 
causal structures in which the points are uniquely labelled in some way {i.e., each 
point in the structure has a property that is shared by no other point in that structure, 
like a set of real numbers). Two past sets in different histories are considered as being 
in the same region if they have the same causal structure, and the same labelling of 
all points in that structure. This is the general idea followed by Rideout and Sorkin 
(2000) to define a causal set dynamics. A causal condition similar to S02 (called 
"labelled Bell causality") is used to constrain a stochastic process on causal sets 
(formalised by Brightwell, Dowker, Garcia, Henson, and Sorkin (2003)), leading to the 
"classical sequential growth" (CSG) model, which is to be understood as preparatory 
work for a full quantum dynamics. 

The problem is that labelling like this threatens the principle of general covariance 
from GR: such "co-ordinatisations" are considered to be unphysical. In the causal 
set dynamics (Rideout & Sorkin, 2000; Brightwell et al., 2003), general covariance 
is introduced as a fundamental principle, the dynamics being defined so that the 
labelling drops out of consideration. But the Bell causality condition is still defined 
in terms of the labels. What is the physical significance of labelled Bell causality 
in a theory that denies labels physical significance? This riddle has not been fully 
answered, and as such work remains to be done on the motivational foundations of 
the CSG model. As a start, this vague conjecture could be explored: that the general 
covariance and labelled Bell causality conditions of CSG imply some kind of generally 
covariant causality condition (as yet undefined). 

It would be more satisfying to have a causality condition that did not rely on 
labelling. Why not just identify a past set in different histories as the same region P 2 
if it has the same causal structure? One problem is that there may be more than one 
past set in a history that has the causal structure associated with P 2 - How are we to 
know which one is "the same region" as an isomorphic past set in another history? 
To get rid of this problem, it could be assumed that there is only one copy of the 
past set. A full specification of a region P 2 could be defined as consisting of histories 
containing only one past set that is isomorphic to a certain causal structure (that 
past set being identified in all such histories as the region V2) ■ 

A number of questions arise. Would such a statement be strong enough? How 
can we know that there is no other past set in the universe identical to our own past? 
Is it even reasonable to identify two past sets from different histories at all? Or is 
it somehow justifiable to neglect these concerns? At this (as yet) ragged edge of the 
concept of causality, the questions are still a little vague, and await better statements 
and answers. 
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3.2.2 Dynamical causal structure and quantum mechanics 

Moving from a stochastic to a quantal process, the problem of dynamical causal 
structure becomes even more virulent. In the stochastic case, we have the problem 
of defining a region P 2 and a concept to replace a full specification in that region. 
Once this is done, spacelike regions A and B can be given some meaning and S02 
resuscitated (see Rideout & Sorkin, 2000, for a concrete example). In the definition 
of QSO, however, we need two full specifications of the same region V2' C and C . 
If the causal structure in the two cases is different, how could this be made to make 
sense? How could we make sure that the events A and A were both defined in the 
same region, spacelike to one in which B and B were defined? Could some trick 
with labelled causal structures work here too? These questions remain to be properly 
explored. 

4 Conclusion 

We have seen how, starting with Reichenbach's principle of common cause, Bell's 
causality principle, here termed screening off (SO), can be reached by a series of 
strongly motivated revisions. Screening off roughly states that two events that occur 
in spacelike regions A and B must be independent when probabilities are conditioned 
on all events to the past of these two regions. It has been shown that, modulo certain 
concerns of measure theory and locality, conditioning on the joint past in SO gives an 
equivalent definition to conditioning on the mutual past. It has also been conjectured 
that further conditioning on any other region spacelike to A and B would result in 
another equivalent definition. With the assumption of stronger locality conditions, 
the Penrose-Percival causality condition is also conjectured to be equivalent. 

Some thoughts on quantum mechanics and the problem it poses for the PCC 
have been given, and the condition of quantum screening off (from Dowker et al., ) 
suggested as a solution. This causality condition is also seen to have two equivalent 
forms, one involving the mutual past and the other the joint past of two regions. 
After this, the problem of indeterministic causal structure was addressed, although 
no concrete conclusions have been drawn on this subject as yet. 

Apart from the conjectures already mentioned, there are many open questions 
relating to the use of the PCC as a fundamental principle of physics. As a begin- 
ning, it would be satisfying to improve the results given in this article, extending 
the framework to non-local events by dropping property (iv) of the least domain of 
dependence, and getting rid of the requirement that there be a finite number of full 
specifications for any given region in the proof of corollary 2. Also, the discussion has 
only made formal use of histories as points in the history space Q; it would be helpful 
to make use of the idea of a history as a set of properties on the causal structure. 
The concept of least domain of decidability could then be expanded so that it directly 
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related to properties of histories instead of being introduced by hand. This might 
be of some use in the definition of the PCC for theories with indeterministic causal 
structure. 

Another closely related concept is that of stochastic Einstein locality (SEL) (Hell- 
man, 1982) (Butterfield, 1994). Some formulations of this causality condition (par- 
ticularly SEL2 in Butterfield, 1994, which includes some discussion relating to the 
Reichenbachian PCC) could be equivalent to the SOI condition. Indeed, SOI is taken 
fairly directly from Bell's local causality (Bell, 1987, pp. 52-66), and Hellman (1982) 
cites Bell's work as the basis for his definition of SEL. It would be interesting to see 
if a link between the framework discussed above and that of SEL could be made, and 
some theorems of equivalence proved. 

The relationship between the SOI condition and deterministic local hidden vari- 
ables, only touched upon above, has significance not only with respect to the main 
motivations of this article, but also to more philosophical uses of the PCC, such as a 
definition of the direction of time. Related to this is the fact that, if we have a theory 
which violates SOI, it may just be because we have failed to introduce the necessary 
common causes into our history space. The idea of an extension of a history space to 
include new events has not been dealt with in this article. Startlingly, analogues of 
Fine's results for stochastic processes (Fine, 1982) are suggested by new work on the 
quantal process. Some of the new results (Craig et al., ) can be taken as examples 
of a conjecture: that, just as a SOI obeying stochastic process can be consistently 
extended so that the history space contains only solutions of a deterministic causal 
("Einstein local") theory, a quantal process obeying QSOl can be similarly extended 
9 . This would give a puzzling new role to local hidden variable theories in quantum 
mechanics, without challenging the Bell theorems. As Einstein locality is understood 
for deterministic, dynamical spacetimes in GR, it is possible that these observations 
could help to define the PCC in stochastic and quantal theories with indeterministic 
causal structure. This subject will be investigated in future work. 
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Appendix: Proof of QSOl QS02 



Firstly, the definition of the least domain of decidability for pseudo events, dom p (A 1 x 
A 2 ) = dom(Ai) U dom(A 2 ) must be examined. The following is to be compared to 
property (i) of dom. 

Lemma 5 For all X, Y G S, 

dom p (X) n dom p (F) = =>• dom p (X n Y) C dom p (X) U dom p (F). 

Proof: Assume dom p (X) n dom p (y) = for two pseudo-events X and Y. Let 
X = X a x X& and Y = Y a x y,. From the definition of dom p we have that dom(Xj) C 
dom p (X) and dom(y) C dom p (F) for i G {a, b}. From property (i) of dom this gives 
dom(X i nF i ) C dom(X)Udom(F) for i,j G {a, b}. This implies that dom p (XnF) C 
dom p (X) U dom p (F). □ 

An analogue of corollary 1 will also be needed. 

Lemma 6 if 71 = for some finite set of regions {A{\, then a pseudo-event 

full specification F of 71 can be written F = f] { Ai where A^ is a pseudo-event full 
specification of Ai. 

Proof: As stated in the main text, a pseudo-event like F is a full specification of 
7Z iff it is a product of two event full specifications of that region. In other words, 
F = G x if, where G and H are full specifications of 7Z. From corollary 1 we have 
that G — Q i Bi where F>i is a full specification of the region Ai, and that H = Hi 
where Cj is also a full specification of the region Ai. This means that 



where Ai = Bi x d. Since Ai is a pseudo-event full specification of Ai, this proves 
the lemma. □ 

The following is the quantal analogue of lemma 4. 
Lemma 7 If, for some pseudo-event P\, 



jl{Ar)Y n n x n Pi) = ii(AnBnx nY nPi)/i(Pi) , and (47) 




(46) 



VXG$ P (^),FG$ P (^), 



n y n Pi)ft(x n Pi) = n x n y n Pi)/i(Pi) , and 
/t(y n Pi)/i(P n x n Pi) = /t(P n x n y n Pi)/t(Pi) , and 
A(y n Pi)/i(x n p) = /i(x n y n p)A(P) 



(48) 
(49) 
(50) 
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then 



VXe%(x),Ye%(y), 
fi{A n x n y n Pi)/i(P ninvn Pi) = fi(A nsninyn n y n Pi). 

(51) 

Proof: Substituting (47) into (50) gives 

fi(AnYn Pi)/t(P nxn Pi)/i(x n Pi)/i(y n Pi) = 

fi(A n p n x n y n Pi)A(x n y n Pi)/i(Pi) 2 

vxe$ p (4ye$ p (y), (52) 

Substituting from (48,49) into the LHS, we have 

fi(A n x n y n Pi)/2(p nxnynPi) = /i(AnPnxnyn Pi)/i(x n Y n Pi), 

(53) 

which is equivalent to (51). □ 
Claim 3 QSOl implies QS02. 



Proof: Assume QSOl. Take any pair of pseudo-events A and P, such that dom p (A) C 
A and dom p (P) C £> and A t| £>. The regions Pi, P 2 , A" and y are defined as in the 
proof of claim 1. 

Consider the pairs of events {(A n X), (P n Y)}, {(A n X),y}, {A, (P n y)} 
and {X, y}, where X is a pseudo-event full specification of the region X, and Y 
is a pseudo-event full specification of y. To apply QSOl to these pairs we need to 
establish that the members of each pair occur in spacelike regions, and find the mutual 
past of those regions. Recall that this full specification X e <& P (X) is defined so that 
dom p (X) C X, and since A is disjoint to X, dom p (AnX) C AUX, for all X G $ P (X), 
from lemma 5 (similarly for B and Y). It is also the case that dom p (X) C A U X 
and dom p (y) C B U y. From this we can see that for all four pairs of pseudo-events, 
the two pseudo-events occur in the pair of regions {(^4 U X), (£> U y)} respectively. 
As noted before, it can be seen that (A U X) t| (B U y) , and that the mutual past for 
this pair of regions is Pi. Thus QSOl can be applied four times, each time using a 
pseudo-event full specification C of the past region V\. QSOl therefore gives: 



VX G $ P (X), Y G %{y),C G $ p (Pi) 

fi(A n y n c)ji(B nxnC)= / u(AnPnxnyn c)/i(C) , and (54) 

fj,(A n y n c)/t(x n c) = n x n y n c)/i(C) , and (55) 

ji(Y n c)/2(s n x n C) = ^(P n x n y n c)fi(C) , and (56) 

/i(y nc)/i(xnc) =/i(xny nc)/t(c). (57) 
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From lemma 7, this implies that 

VX e%{X),Y e%(y),C e^iVJ 
fi(A n x n y n c)/i(s ninynC) = /i(Anflninyn c)ji(C). (58) 

From lemma 6, if C G %(Vi), X G %{X) and F G then CnXnF G $ P (P 2 )- 

Eqn. (58) is therefore equivalent to (45) and so QS02 is satisfied for the pair of 
pseudo-events {A,B}. □ 

Claim 4 QS02 implies QS01. 

Proof: As above, take any pair of pseudo-events A and B, such that dom p (A) C 
.4 and dom p (_B) C B and A \ B. We only need note that, if dom p (A) C ^4 and 
dom p (_B) C £> ,then it is also true that dom p (A) C AUX and dom p (_B) C i3U^, where 
and y are defined as before. The joint past of the pair of regions {^4 U X , B U y} 
is Vi. Since AL)X\\Buy, QS02 gives 

n B\C) — jl(A\C)jl(B\C) VCG$ p (Pi). (59) 

From this, we know that for any pseudo-events dom p (A) C A and dom p (_B) C B, if 
A \\ B, then the above equation holds. This is exactly QS01. □ 

Corollary 3 QSOl is equivalent to QS02. 
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